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Amendments to the Claims: 

This listing of clainis will replace all prior versions, and listings, of claims in the 
application. 

Listing ftfq^ims: 

1 . (carrently amended) A method for balancing the load of a parallel processing system 
having a plurality of parallel processing elements arranged in a loop, wherein each processing 
element (JPEr) has a local number of tasks associated therewith, wherein r represents the number 
for a selected processing element, and wherein each of said processing elements is operable to 
commimicate with a clockwise adjacent processing element and with an anti-clockwise adjacent 
processing element, the method comprising: 

determining within each processing element a total number of tasks present within 
said loop; 

calculating a local mean number of tasks within each of said plurality of processing 
elements; 

calculating a local deviation from said local mean number within each of said 
plurality of processing elements; 

determining a sum deviation from said local deviations within each of said processing 
elements for one-half said loop in an anti-clockwise direction, said one-half of said loop 
being relative to each of said selected processing elements; 

determining a sum deviation from said local deviations within each of said processing 
elements for one-half said loop in a clockvsdse direction, said one-half of said loop being 
relative to each of said selected processing elements; 

detennining a clockwise transfer parameter and an anti-clockwise transfer parameter 
from said sum deviations within each of said processing elements; and 

redistributing tasks among said plurality of processing elements using in r e sponse to 
said clockwise transfer parameters and said anti-clockwise parameters within each of said 
plurality of processing elements. 
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2. (original) The method of claim 1 wherein said determining within each of said 
processing elements a total number of tasks present within said loop, comprises: 

transmitting said local number of tasks associated with each of said processing 
elements to each other of said plurality of processing elements within said loop; 

receiving within each of said processing elements said number of local tasks 
associated with said each other of said plurality of processing elements; and 

summing said number of local tasks associated with each of said processing elements 
with said number of local tasks associated with each other of said plurality of processing 
elements. 

3, (original) The method of claim 1 wherein said determining said total number of tasks 
present within said loop includes solving the equation V = ^ v, , where N represents the number . 

of processing elements in said loop, and represents said local number of tasks associated with 
an i'^ processing element in said loop. 

4. (previously presented) The method of claim 1 wherein said calculating a local mean 
number of tasks within each of said plurality of processing elements includes solving the 
equation = Trunc((y + £^ ) / jV) , where is said local mean for PEr» N is the total number 

of processing elements in said loop, and Er is a number in the range of 0 to (N-l\ V is the total 
number of tasks, and wherein each processing element has a different Er value. 

5, (previously presented) The method of claim 4 wherein said Trunc function is responsive 
to the value of Er such that said total number of tasks for said loop is equal to the sum of the 
local mean number of tasks for each of said plurality of processing elements in said loop (i.e.. 
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6. (currently amended) The method of claim 4 wherein said local mean 

M, = Trunc{{V -^-E^)! N) for each local PEr within said loop is equal to either X or (AT+l), and 
Er is a number in the range of 0 to (N-V) and X represents the local mean . 

7. (original) The method of claim 1 wherein said calculating a local deviation within each 
of said plurality of processing elements, comprises finding the difference between said local 
number of tasks and said local mean number for each of said plurality of processing elements. 

8. (original) The method of claim 1 wherein said determining a sum deviation within each 
of said processing elements for one-half of said loop in an anti-clockwise direction comprises: 

transmitting said local deviation associated with each of said processing elements half 
way around said loop in an anti-clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; 

receiving said local deviation associated with each other of said processing elements v ^ 1 

half way around said loop in a clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; and 

summing said local deviations associated with each other of said processing elements 
half way around said loop in a clockwise direction, 

9. (original) The method of claim 1 wherein said determining a sum deviation within each 
of said processing elements in one-half of said loop in a clockwise direction comprises; 

transmitting said local deviation associated with each of said processing elements half 
way around said loop in an clockwise direction^ said one-half of said loop being relative 
to each of said selected processing elements; 

receiving said local deviation associated with each other of said processing elements 
halfway around said loop in a anti-clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; and 

sumiTung said local deviations associated with each other of said processing elements 
half way around said loop in an anti-clockwise direction. 
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10. (previously presented) The method of claim 1 wherein said determining a clockwise 
transfer parameter and an anti-clockwise transfer parameter within each of said processing 
elements comprises: 

setting the clockwise transfer parameter equal to (2iS' A-C)-^A\ and 
setting the anti-clockwise transfer parameter equal to {2S + C-^) 4, where S 
represents said local deviation of a selected processing element; C represents said sum 
deviation in a clockwise half of loop relative to said selected processing element, and A 
represents said sum deviation in an anti-clockwise half of loop relative to said selected 
processing element 

11. (previously presented) The method of claim 1 wherein said determining a clockwise 
transfer parameter To and an anti-clockwise transfer parameter Tg within each of said processing 
elements comprises at least one of: 

setting the clockwise transfer parameter equal lo Trunc[Q,S + A) 4J and setting the 
, : anti-clock\yise transfer parameter equal to 5 - Tc and 

setting the anti-clockwise transfer parameter equal to Tninc[1i!2S - A) ^ 4] and setting 
the clockwise transfer parameter equal \jaS'Ta\ 
where Mag ^ abs(25), S represents the local deviation of a selected processing element, A 
represents the number of tasks passing through the current processing element, whereby if A > 
Mag then set A equal to Mag and if A < -Mag, then set A equal to -Mag. 

12. (previously presented) A method for reassigning tasks among an odd numbered plurality 
of processing elements within a parallel processing system, said processing elements being 
connected in a loop and each having a local number of tasks associated therewith, the method 
comprising: 

determining the total number of tasks on said loop; 

computing a local mean value for a selected processing element; 

computing a local deviation for said selected processing element, said local deviation 
representative of the difference between said local number of tasks for said selected 
processing element and said local mean value for said selected processing element; 

Pages of 10 

W)-n70380\'l 

PA6E8f15'RCVDAT1/16j20085:27:30PM[EastimS^^^ 



JAN 16 2008 17:30 FR JONES DPY PGH 47960 4123947959 TO 1502203061 520 ltt9 P. 09/15 



Appl. No. 10/689,336 

Supplemental Amdt^ dated 16 January 2008 

inserting a phantom processing element within said loop having a local deviation of 
zero when the loop is comprised of an odd number of processing elements; 

summing said local deviations of said processing elements located within one-half of 
the loop in an anti -clockwise direction relative to said selected processing element; 

summing said local deviations of said processing elements located within one-half of 
the loop in a clockwise direction relative to said selected processing element; 

computing a number of tasks to transfer in a clockwise direction for said selected 
processing element in response to said summing of said local deviations; 

computing a number of tasks to transfer in an anti-clockwise direction for said 
selected processing element in response to said summing of said local deviations; and 

reassigning tasks relative to the said number of tasks to transfer in a clockwise 
direction and said number of task to transfer in an anti-clockwise direction. 

13. (original) The method of claim 12 wherein said determining the total number of tasks on \ 
said loop, comprises: 

transmitting said local number of tasks associated with each of said processing 
elements to each other of said plurality of processing elements within said loop; 

receiving within each of said processing elements said number of local tasks 
associated with said each other of said plurality of processing elements; and 

summing said number of local tasks associated with each of said processing elements 
with said number of local tasks associated with each other of said plurality of processing 
elements. 

14. (previously presented) The method of claim 1 2 wherein computing a local mean value 
for a selected processing element includes solving Ihe equation - Trunc((V + Ej.)/ N)^ 
where Mr is said local mean for a processing element PE^, N is the total number of processing 
elements in said loop, V\s the total number of tasks, and Er is a nmnber in the range of 0 to (7^- 
1). 
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15. (previously presented) The method of claim 14 wherein said Trunc function is 
responsive to the value of Er such that said total number of tasks for said loop is equal to the sum 
of the local mean number of tasks for each of said plurality of processing elements in said loop 
and wherein each processing element has a different Er value assigned. 

1 6. (previously presented) The method of claim 1 2 wherein said inserting a phantom 
processing element within said loop further comprises: 

locating said phantom processing clement in a position within said loop that is 
diametrically opposed to said processing element. 

17. (original) The method of claim 12 wherein said computing a local mean value for a 
s^le^.|^ processing element, said computing a local deviation for said selected processing 
element, said inserting a phantom processing element within said loop, said summing said 
deviation of said pfocessiiig elcmehts located within one-half of the loop in an anti-clockwise 
direction, summing said deviation of said processing elements located within one-half of the loop 
in a clockwise direction^ computinjg a number of tasks to transfer in a clockwise direction for 
said selected processing element, computing a number of tasks to transfer in an anti-clockwise 
direction for said selected processing element, and reassigning tasks relative to the said number 
of task to transfer in a clockwise direction and said number of tasks to transfer in an anti- 
clockwise direction are completed simultaneously for each of said plwality of processing 
elements within said loop. 

18. (original) The method of claim 12 wherein said summing said deviation of said 
processing elements located within one-half of the loop in an anti-clockwise direction relative to 
said selected processing element comprises: 

transmiuing said local deviation associated with each of said processing elements 
halfway around said loop in an anti-clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; 

receiving said local deviation associated with each other of said processing elements 
halfway around said loop in a clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; and 
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summing said local deviations associated with each other of said processing elements 
half way around said loop in a clockwise direction, 

19- (original) The method of claim 12 wherein summing said deviation of said processing 
elements located within one-half of the loop in a clockwise direction relative to said selected 
processing element comprises: 

transmitting said local deviation associated with each of said processing elements half 
way around said loop in an clockwise direction, said one-half of said loop being relative 
to each of said selected processing elements; 

receiving said local deviation associated with each other of said processing elements 
half way around said loop in a anti-clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; and 

summing said local deviations associated with each other of said processing elements 
half way around said loop in an antirclockwise direction. 

20» (currently amended) A computer storage media r^able m e mory device carrying a set of 
instructions which, when executed, perform a method comprising: 

determining within each processing element a total number of tasks present within 
said loop; 

calculating a local mean number of tasks within each of said plurality of processing 
elements; 

calculating a local deviation from said local mean number within each of said 
plurality of processing elements; 

determining a sum deviation from said local deviations witliin each of said processing 
elements for one-half said loop in an anti-clockwise direction, said one-half of said loop 
being relative to each of said selected processing elements; 

determining a siun deviation from said local deviations within each of said processing 
elements for one-half said loop in a clockwise direction, said one-half of said loop being 
relative to each of said selected processing elements; 

determining a clockwise transfer parameter and an anti -clockwise transfer parameter 
from said sum deviations within each of said processing elements; and 
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redistributing tasks among said plurality of processing elements in response to said 
clockwise transfer parameters and said anti-clockwise parameters within each of said 
plurality of processing elements. 
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